Numerical performance of penalized comparison to overfitting for multivariate kernel density estimation
نویسندگان
چکیده
Kernel density estimation is a well known method involving smoothing parameter (the bandwidth) that needs to be tuned by the user. Although this has been widely used, bandwidth selection remains challenging issue in terms of balancing algorithmic performance and statistical relevance. The purpose paper study recently developed method, called Penalized Comparison Overfitting (PCO). We first provide new theoretical guarantees proving PCO performed with non-diagonal matrices optimal oracle minimax approaches. then compared other usual methods (at least those which are implemented R-package) for univariate also multivariate kernel on basis intensive simulation studies. In particular, cross-validation plug-in criteria numerically investigated PCO. take home message can outperform classical without additional cost.
منابع مشابه
Feature significance for multivariate kernel density estimation
Multivariate kernel density estimation provides information about structure in data. Feature significance is a technique for deciding whether features – such as local extrema – are statistically significant. This paper proposes a framework for feature significance in d-dimensional data which combines kernel density derivative estimators and hypothesis tests for modal regions. For the gradient a...
متن کاملBandwidth Selection for Multivariate Kernel Density Estimation Using MCMC
Kernel density estimation for multivariate data is an important technique that has a wide range of applications in econometrics and finance. However, it has received significantly less attention than its univariate counterpart. The lower level of interest in multivariate kernel density estimation is mainly due to the increased difficulty in deriving an optimal datadriven bandwidth as the dimens...
متن کاملPenalized semiparametric density estimation
In this article we propose a penalized likelihood approach for the semiparametric density model with parametric and nonparametric components. An efficient iterative procedure is proposed for estimation. Approximate generalized maximum likelihood criterion from Bayesian point of view is derived for selecting the smoothing parameter. The finite sample performance of the proposed estimation approa...
متن کاملApproximate inference of the bandwidth in multivariate kernel density estimation
Kernel density estimation is a popular and widely used non-parametric method for data-driven density estimation. Its appeal lies in its simplicity and ease of implementation, as well as its strong asymptotic results regarding its convergence to the true data distribution. However, a major difficulty is the setting of the bandwidth, particularly in high dimensions and with limited amount of data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Esaim: Probability and Statistics
سال: 2023
ISSN: ['1292-8100', '1262-3318']
DOI: https://doi.org/10.1051/ps/2022018